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Abstract 



We prove that on compact Alexandrov spaces with curvature bounded 
below the gradient flow of the Dirichlet energy in the L 2 -space produces 
the same evolution as the gradient flow of the relative entropy in the L 2 - 
Wasserstein space. This means that the heat flow is well defined by either 
one of the two gradient flows. Combining properties of these flows, we are able 
to deduce the Lipschitz continuity of the heat kernel as well as Bakry-Emery 
gradient estimates and the r2-condition. Our identification is established by 
purely metric means, unlike preceding results relying on PDE techniques. 
Our approach generalizes to the case of heat flow with drift. 

1 Introduction 

The heat equation is one of the most important evolutionary PDEs. It is a well 
known fact in modern analysis that such an equation, say in M. n , can be seen as 
the gradient flow of the Dirichlet energy 
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(1.1) 



in the space L 2 (M n ,£ n ). This viewpoint has been extended to the concept of 
Dirichlet form (see e.g. [61 [12]) and it has grown up into a huge research field 
in potential analysis and probability theory. More recently, Jordan, Kinderlehrer 
and Otto [15] understood that the same equation can be seen as the gradient flow 
of the relative entropy 



in the L 2 -Wasserstein space (^2(R n ), W2), where ^O^ 71 ) stands for the space 
of Borel probability measures on M n with finite second moment. This intuition, 
with the further studies of Otto [31], has been one of the fundamental ingredients 
that drove the research in the field of gradient flows in relation with optimal 
transportation problems in the past decade (see, e.g., pQ and |42j). 

The aim of this paper is to carry on the study of the heat flow as gradient 
flow of the two very different functionals, (|1.1|) and (|1.2|) in the two metric spaces 
L 2 (M. n ,C n ) and (^(K 71 ), W2), in a non-smooth setting. The point is the follow- 
ing. On the one hand, it is known that these two gradient flows produce the same 
evolution 'in all the smooth settings', i.e., it has been proved that in R n , in Rie- 
mannian manifolds with Ricci curvature bounded below and in compact Finsler 
manifolds (for the latter see the work of Sturm and the third author [28]), the gra- 
dient flow of the Dirichlet energy with respect to the L 2 -distance coincides with 
the gradient flow of the relative entropy with respect to the Wasserstein distance 
W2 (see also [ID] . [T6] for related work on different kinds of spaces). On the other 
hand, these flows are studied well also in some non-smooth settings. These raise 
the natural question: do these two notions always coincide? 

We remark that a natural abstract setting where one could try to give an 
answer to this question is the one of metric measure spaces with Ricci curvature 
bounded below (defined by Sturm [40] and Lott and Villani [26], see Definition ^. 4|) . 
Indeed, in order to define either one of the two gradient flows, one usually needs 
both a metric and a measure on the space considered. Furthermore, it is known 
that in the Riemannian setting the fact that the heat flow does not lose mass is 
strictly related to the bound from below on the Ricci curvature, so that one has to 
assume some kind of lower Ricci curvature bound also in the non-smooth setting. 

In this paper, we restrict our attention to the case of finite dimensional, com- 
pact Alexandrov spaces (X,d,T~l n ) of curvature bounded below without bound- 
ary, equipped with the Hausdorff measure T~L n . An Alexandrov space of curvature 
bounded from below by k with k G M is a metric space of sectional curvature 
bounded from below by k in the sense of the triangle comparison property (see 
Definition 12 . X |> . As naturally expected and recently shown in [34] and [43], such 
spaces actually have Ricci curvature bounded below in the sense of Lott-Sturm- 
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Villani (see Remark 12 . 5[) . Our following main result is the first one establishing 
the equivalence of the two gradient flows in a genuinely non-smooth setting (see 
Theorem 13.11 for the slightly more general statement): 

Theorem 1.1 Let (X,d,7i n ) be a finite- dimensional compact Alexandrov space 
without boundary. Then for any fo 6 L 2 (X,7i n ) with foH n 6 ZP{X), the gradient 
flow {ft)te[o,oo) of the Dirichlet energy with respect to the 1? -distance gives the gra- 
dient flow {ftT~i n )t&[o,oo) of the relative entropy with respect to the 1? -Wasserstein 
distance Wi, and vice versa. 

On Alexandrov spaces, the structure of the heat flow as gradient flow has already 
been studied in |20j for the Dirichlet energy approach and in [27] by the third 
author (see also p3] and [38] ) for the relative entropy approach. However, up to 
now it was not clear that these two notions coincide. 

We observe that the idea behind our proof of the theorem is completely dif- 
ferent from the ones used in the aforementioned smooth settings. Indeed, all the 
proofs available in the smooth setting had the following structure: 

• one studies the gradient flow of the Dirichlet energy with respect to L 2 and 
writes down the equation that it satisfies; 

• one studies the gradient flow of the relative entropy with respect to W2 and 
writes down the equation that it satisfies; 

• one realizes that the two equations are actually the same and calls into 
play the uniqueness of the solution of the equation to conclude that the two 
gradient flows coincide. 

Our approach, instead, is completely different and in particular does not pass from 
the study of the heat equation in the Alexandrov setting. Read back in M. n our 
proof gives a new and purely metric way to prove the coincidence of these two 
gradient flows. 

After having proved the identification, we can combine them: their interaction 
gives fruitful applications. Among them, the Lipschitz continuity (Theorem l4.4p of 
the heat kernel is derived as a corollary of a Bakry-Emery type gradient estimate 
(Theorem I4.3P , which follows with the aid of a result of the second author [19j . 
The Lipschitz regularity improves the Holder regularity established in [20]. We 
believe that it is curious and worth to be underlined that such Lipschitz property 
immediately follows when one knows the equivalence of the two gradient flows, 
but is not at all trivial if one sticks to either one of the two approaches alone. 
Our Lipschitz regularity enables us to deduce the T2-condition (Theorem 14. 6[) 
from a Lott-Sturm- Villani Ricci curvature bound. The r2-condition as well as 
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the Wasserstein contraction of heat flow and the Bakry-Emery gradient estimate 
for heat semigroup are known to be analytic characterizations of a lower Ricci 
curvature bound (see [371 El E3] ) • We show the equivalence of those three analytic 
conditions even on Alexandrov spaces (with sharp constants, Theorem 14. 8p . 

It should be remarked that our approach easily generalizes to the case of heat 
flow with drift, where the drift is given by the gradient of a semiconvex potential 
V. In other words, the current approach can be used to study the heat flow on 
weighted Alexandrov spaces of the kind (X, d, e~ v n n ) (see the end of Section gj). 

After having completed the work on this paper, we got aware of a paper [Hj 
by Zhang and Zhu where the Lipschitz continuity of the heat kernel has been 
studied, via a completely different argument. In particular, their proof relies on 
the Lipschitz continuity of harmonic functions due to Petrunin [33J. They even 
proceeded their study to [45] and [35] , where they showed a Bochner type formula 
and Li-Yau estimates under their own notion of lower Ricci curvature bound which 
is stronger than the one of Lott-Sturm-Villani. It is not discussed in [441 l45l [35] 
whether their approach generalizes to the case of weighted Alexandrov spaces. 

The organization of the article is as follows. Section [2] is devoted to prelim- 
inaries for Alexandrov geometry and known results on the gradient flow of the 
Dirichlet energy as well as the gradient flow of the relative entropy on compact 
Alexandrov spaces. We prove our main theorem in Section [3l and discuss its 
applications in Section |H 

2 Preliminaries and notations 

2.1 Alexandrov spaces 

We first review the basics of Alexandrov geometry, see [8], [30] and [7] for details. 

Let (X, d) be a metric space. A rectifiable curve 7 : [0, 1] — > X is called a 
geodesic if it is locally length minimizing and parametrized with constant speed. 
(Precisely, for any to E [0, 1], there is e > such that d( r y(s), j(t))/\s— 1\ is constant 
for all s,t E [to — £,to + e]n [0, 1].) If 7 is minimizing between endpoints, then we 
call it a minimal geodesic. We say that (A, d) is a geodesic space if any pair of 
points in X are connected by a minimal geodesic. 

For k E R 3 we denote by M 2 (fc) the simply-connected, two-dimensional space 
form of constant sectional curvature k. Given three points x,y,z E X, with 
d(x,y) + d(y,z) + d(z,x) < 2ir/Vk if k > 0, we can take a comparison triangle 
Axyz C M 2 (/c) such that d(x,y) = d(x,y), d(y,z) = d(y,z) and d(z,x) = d(z,x). 
Such a triangle is unique up to a difference of isometry. 
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Definition 2.1 (Alexandrov spaces) For k G R, a complete geodesic space 
(X, d) is called an Alexandrov space of curvature bounded from below by k if, for 
any three points x,y,zGX (with d(x, y) + d(y, z) + d(z, x) < 2ir/yfk if k > 0) and 
any minimal geodesic 7 : [0, 1] — >■ X from y to z, we have d(x,'j(t)) > d(x,~f(t)) 
for all t G [0,1], where Axyz C M 2 (fc) is a comparison triangle of Axyz and 
7 : [0, 1] — > M 2 (k) is the unique minimal geodesic from y to z. 

Example 2.2 (a) A complete Riemannian manifold is an Alexandrov space of 
curvature bounded from below by k if and only if its sectional curvature is greater 
than or equal to k everywhere. 

(b) If (X, d) is an Alexandrov space of curvature bounded from below by k, 
then the scaled metric space (X, c • d) with c > is an Alexandrov space of 
curvature bounded from below by k/c 2 . 

(c) For a convex domain D in the Euclidean space M. n , the boundary dD 
equipped with the length distance is an Alexandrov space of nonnegative curva- 
ture. 

(d) Let (M, g) be a Riemannian manifold of nonnegative sectional curvature 
and G be a compact group acting on M by isometries. Then the quotient space 
M/G equipped with the quotient metric is an Alexandrov space of nonnegative 
curvature. 

(e) If a sequence of Alexandrov spaces of curvature bounded from below by k 
is convergent with respect to the Gromov-Hausdorff distance, then its limit space 
is again an Alexandrov space of curvature bounded from below by k. 

Fix x G X and let T, x be the set of all unit speed geodesies 7 : [0, 1] — >■ X with 
7(0) = x. For 7,7/ G T> x , thanks to the curvature bound, the joint limit 

Z 3 ;(7,?7) := arccos hm 



■M^o 2st 

exists and is a distance on T> x /~ where 7 ~ n holds if Z x (7, i]) = 0. We define the 
space of directions (T, x , Z x ) at x as the completion of with respect to Z x . 

The tangent cone (K x , d) is the Euclidean cone over Z- x ) , i.e., 

K x :=X X x [0,00)/^ x {0}, 
d((7, «), (»/,*)) == V s2 + t 2 - 2st cos Z x (-y, rf). 

The inner product on K x is defined by ((7, s), (rj, t)) x := st cos ^(7, rf). In Rie- 
mannian manifolds, spaces of directions and tangent cones correspond to unit 
tangent spheres and tangent spaces, respectively. 
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The Hausdorff dimension of X is an integer or infinity. From here on, we 
consider a compact n- dimensional Alexandrov space of curvature bounded from 
below by k without boundary equipped with the n-dimensional Hausdorff measure 
TV 1 (see [8] for the definition of the boundary of Alexandrov spaces). We say 
that x E X is a singular point if K x is not isometric to the Euclidean space R n , 
and denote the set of singular points by Sx- We remark that H n (Sx) = holds 
whereas Sx can be dense in X (see [8], [30]). 

2.2 Dirichlet energy and the associated gradient flow 

We introduce the Sobolev space and the Dirichlet energy following |20j . and will 
see that it coincides with other notions of Sobolev spaces. We begin by dis- 
cussing a C 1 -differentiable structure of the set of regular points X \ Sx estab- 
lished in [8] and |30j . We remark that Perelman extends this to -DC 1 -structure 
(via 'difference of concave functions', see [32]), but the (^-structure is enough for 
considering the Sobolev space. There is a weak C -atlas {(Uj,, V$, 0)}^ e $ in the 
sense that U$ C X is an open set, <fi : — > R n is a bi-Lipschitz embedding, 
Vcj, C Iff/) with U^ g$ Vcf, D X \ Sx, and that the coordinate change (j>2 ° (p^ 1 is 
C 1 on <M^i n n (X \ S x )) if ny fc /fl ([30J Theorem 4.2(1)]). Such 
charts are constructed through the distance function. Precisely, (f> is introduced as 
4>(x) := (d(x , pi) , d(x , P2) , ■ ■ ■ , d(x , p n )) for suitable pi,P2, ... ,p n S X, and then 
is chosen as the set of regular points x such that a minimal geodesic between 
x and pi is unique for all i ([30, Section 3]). It is worth mentioning that, for any 
e > 0, (j) can be (l+e)-bi-Lipschitz by taking smaller (cf. Theorem 10.9.16]). 
We also remark that it is possible to modify 4> by taking an average so as to satisfy 
y<j> = Ufj, ([301 Section 5]), but it is unnecessary for our discussion (just like [3 
Remark 2.9]). 

We say that a function / on X is differentiable at a regular point x € X\ Sx if 
x 6 Vj, and fo<f)~ l is differentiable at 4>(x) for some <f> E 3>. Then we can define the 
gradient vector V f(x) E K x by identifying K x and 1" through <fi. (To be precise, 
by virtue of the first variation formula ( [30] Theorem 3.5]), each di := d(-,pi) is 
differentiable at x with Vdi(x) = —Vi, where Vi E K x is the tangent vector of the 
unique, minimal, unit speed geodesic from x to pi. Then K x is linearly identified 
with W 1 as Yli a i v i = (~ a i)-) Moreover, again due to the first variation formula, 
we obtain the Taylor expansion 

/( 7 (t)) = f(x) + t(Vf(x), 7(0)), + o x (t), (2.1) 

where 7 : [0, 5] — )■ X is a minimal geodesic emanating from x and o x (t) is in- 
dependent of the choice of 7 (see Lemma 3.4], and note that the remainder 
term in the first variation formula for d(-,Pi) at x indeed depends only on x and 
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Pi). Another important fact we will use is the Rademacher theorem, namely a 
Lipschitz function / on X is differentiable H n -&.e.. This easily follows from the 
usual Rademacher theorem for / o (see [5J Corollary 2.14]). It follows from 
(|2,ip that y 7 (Vf(x), Vf(x)) coincides with the local Lipschitz constant |Vl/|(x) 
given by 

y^x d{x,y) 

Based on the notion of gradient vector, we define the Sobolev space and the 
Dirichlet energy as follows (see [20] and |22| for details). For a function / : X — > M 
such that / o <p~ l £ VF 1,2 (0(L^)) for all (ft £ we introduce the weak gradient 
vector V/(x) E K x for a.e. x € as the element corresponding to the weak 
gradient vector V(/ o </> _1 )((/>(x)). We define the Sobolev space W 1,2 {X) and the 
Dirichlet energy £ by 

W l >\X) := {/ G L 2 (X,H n ) JjVf,Vf)dH n < oo\ , 

£(f, g) := / (V/, Vg) dU n for /, g G W^ 2 (X). 
Jx 

We do not divide £ by 2 for notational simplicity. Note that £ coincides with 
the energy functional introduced by Korevaar and Schoen [18] (we can reduce 
the argument to the Euclidean case by using a (1 + e)-bi-Lipschitz chart; see [22~1 
Theorem 6.2]). We also remark that the set of Lipschitz functions C Lip (X) is 
dense in W l > 2 (X) with respect to the Sobolev norm ||/||Li,2 = H/IH2 +£(f,f) 
([201 Theorem 1.1]). 

Furthermore, if / is a Lipschitz function, then the weak gradient vector V/(x) 
coincides with the gradient vector as in (|2.1|) a.e. x, and hence (V/(x), V/(x)) = 
|Vl/|(x) holds a.e. x. Therefore £ also coincides with Cheeger's energy func- 
tional ([9]), because the local Lipschitz constant is the minimal generalized upper 
gradient ([HI Theorem 6.1]) and Lipschitz functions are dense in both Sobolev 
spaces (thanks to the weak Poincare inequality for upper gradients and the vol- 
ume doubling condition, [9l Theorem 4.24]). Indeed, in our framework, the volume 
doubling condition directly follows from the Bishop-Gromov volume comparison 
theorem and the Poincare inequality is a consequence of |21j and [36], for instance. 



By following the general theory of bilinear forms, there exists a nonpositive 
selfadjoint operator (A, 12(A)) on L 2 (X,U n ) associated with (£, W^ 2 {X)). It is 
characterized by the following identity: 

B{g, f) = - [ gAf dH n , f e D(A), g e W^ 2 (X). (2.2) 
Jx 
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We call A the Laplacian as in the classical case. Based on a general theory of 
functional analysis, the one-parameter semigroup of contractive symmetric linear 
operators T t = e* A on L 2 (X,H n ) is defined associated with A. For any / G 
L 2 (X,T~l n ), Ttf solves the (linear) heat equation dtu = Au with u(0, •) = / in the 
sense that T t f G D(A) for t > and 

lim Tt f~ f = Af in L 2 (X, H n ) for / G D(A), 
limit/ = f in L 2 (X,H n ) for / G L 2 {X,7i n ). 

Note that T t is Markovian in the sense that < T t f < 1 holds whenever < / < 1. 
As shown in [201 Theorem 1.5], there exists a continuous function (t,x,y) \— > 
Pt(x,y) on (0, oo) x X x X satisfying the following properties: 

(i) For any / G L 2 (X,n n ), t > and T^-a.e. 

T t f(x)= [ Pt (x,y)f(y)H n (dy). (2.3) 
Jx 

(ii) For any s, t > and x,y £ X, 

p t (x,y) = p t (y,x), (2.4) 
p s+ t(x,y)= / p s (x,z)pt(z,y)'H n (dz), (2.5) 

Pt (x,z)H n {dz) = 1, (2.6) 
Pt(x,y) > 0. 



A' 



See [20, Theorems 1.4, 1-5(3)] for the continuity of Pt(x, y). The equality f|2.6|) 
follows from the fact that 1 G VF 1,2 (X) and 8(1, 1) = 0, because X is assumed to 
be compact. As in the classical case, we call pt(x, y) the heat kernel. The existence 
and these properties of pt(x, y) are deduced from the Poincare inequality for £ and 
the volume doubling condition, together with results in [39] . 

Remark 2.3 The bilinear form {£, W 1,2 (X)) is a symmetric strongly local reg- 
ular Dirichlet form (see [20] for it and further details; see [61 [12] for basics on 
Dirichlet forms). Moreover, Tt enjoys the strong Feller property. As a result, 
there exists a diffusion process ((Xt)t>o, (^ x )xex) on the whole space X associ- 
ated with (£, W 1 ' 2 {X)) in the sense that E x [f(X t )] = T t f{x) for / G C(X), i£l 
and t > 0. 
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Denote by &(X) the set of all Borel probability measures on X. Let us define 
a positive Borel measure Ttv for v G &(X) and t > by 



Thanks to (j2.6|) . G &(X) holds. By definition, Ttv is absolutely continuous 
with respect to U n for t > 0. When dv = fd?i n , it holds dT t v = T t fdU n . In 
this paper, we call the evolution (i, u) i— > Tti/ i/ie gradient flow of the Dirichlet 
energy (since the Dirichlet energy is a functional on the L 2 -space of functions, 
this terminology should be interpreted in an extended sense). Indeed, it is easy 
to see from 



that the Radon-Nikodym derivative dTtv/d'H n is the gradient flow of 8/2 with 
respect to the L 2 -norm. 

Before closing this subsection, we review the derivation property of W 1,2 (X). It 
is formulated as follows: For f u . . . , f k ,g G W l > 2 {X)C\L°°{X, H n ) and $ : R k -> R 
which is C 1 on the range of (f\, . . . , 3>(/i, ■ ■ ■ , fk) belongs to W 1,2 (X) and 



(V*(/i, . . . , f k ), V 5 > = J2 7j-(A> • • • ' (V/j ' V5) ^ n " a ' e - (2 - 7) 



This identity directly follows from the definition of W l,2 {X) or from the strong lo- 
cality of the Dirichlet form (£, W^ 2 (X)) (see [0 Corollary 1.6.1.3] and [HI Section 
3.2] for the latter). 

2.3 Gradient flows in the Wasserstein space 

We next introduce the Wasserstein space and a purely metric notion of gradient 
flows in it. We refer to [1] and |42j for the basic theory as well as the recent diverse 
developments. 

Given fi, v G &(X), a probability measure it G &{X x X) is called a coupling 
of /i and ^ if ix{A x X) = fi(A) and ir(X x A) = v(A) hold for all Borel sets Ac X. 
Then, for 1 < p < oo, we define the L p - Wasserstein distance as 





fe 
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where ir runs over all couplings of p and v. In most parts, we work in the quadratic 
case p = 2. The L 2 -Wasserstein space (^(X), W2) becomes a metric space and 
inherits the compactness from (X,d). Moreover, (^(JT), W2) is a geodesic space. 
If /i is absolutely continuous with respect to 7i n , then a minimal geodesic (pt)te[o,i] 
from /i to any v is unique and pt is also absolutely continuous for all t G (0, 1) (see 
[5] for a more detailed characterization of /if, and [llj for the absolute continuity). 
For p G £P(X), we define the relative entropy by 

Ent(^) := / p\ogpdU n 
Jx 

when p = pU n with p G L X (X, H n ), and Ent{p) := 00 otherwise. Set 0>*(X) := 
{p G &(X) I Ent(p) < 00}. Note that Ent is lower semi-continuous with respect 
to W2 and satisfies Ent(p) > — logH n (X) by Jensen's inequality. 

Definition 2.4 (The curvature-dimension condition) For K £ M, we say 

that (X,d,T-L n ) satisfies the curvature- dimension condition CD(K, 00) if Ent is 
-fT-geodesically convex in the sense that any pair p,u 6 <^(X) admits a minimal 
geodesic (/it)t e [o,i] from p to v such that 

Ent(jiif) < (l-t)Ent(Ai)+tEnt(z/)-y(l-t)tPF 2 (^,i/) 2 

holds for all t G [0,1]. 

We remark that the above inequality is obvious if p ^ &*(X) or ^ ^*(X). 
Therefore it is sufficient to consider /x, v £ ^*(X), and then a minimal geodesic 
between them is unique. 

Remark 2.5 (i) The curvature-dimension condition CD(K, 00) for general metric 
measure spaces is introduced and studied independently in [ID] and [26], and 
known to be equivalent to the lower Ricci curvature bound Ric > K for complete 
Riemannian manifolds equipped with the Riemannian distance and the volume 
measure ([37]). 

(ii) The condition CD(K, N) for N G (l,oo) is also introduced in [41J and [24]. 
In general, CD(K, N) implies CD(K, 00). In the Riemannian case, CD(K,N) is 
equivalent to Ric > K and dim < N. 

(iii) It is recently demonstrated in [34] and [43] that n-dimensional Alexandrov 
spaces of curvature bounded from below by k satisfy CD((n — l)k, n) (and hence 
CD((n — l)k, 00)), as is naturally expected from the relation between the sectional 
and the Ricci curvatures. 
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There is a well established theory on the gradient flow of geodesically convex 
functions as comprehensively discussed in [I]. For later convenience, we recall a 
couple of notions in a general form. We say that a curve (p,t)tei C &{X) on an 
interval I C M is absolutely continuous if there is / G L 1 ^) such that 

W 2 (fH,fJL s )< J S f(r)dr (2.8) 

for all t,s £ I with t < s. Note that absolutely continuous curves are continuous. 
For an absolutely continuous curve (fit)tei, the metric derivative 

u+ := lim — 

im h-^o \h\ 

is well-defined for a.e. t S I ([H Theorem 1.1.2]). Moreover, \fit\ belongs to 
L X (J) and is the minimal function for which (12. 8j) holds. Given a functional £7 : 
&(X) —> MU {+oo}, we consider a gradient flow of E solving 11 fit = — V-E(/xt)" in 
the following sense. For \x G <5^(X) with -E(/x) < oo, we define the local slope as 

|V. E | M :=E — (2 . 9 ) 

If .£7 is iiT-geodesically convex, then we have 

\E(jh) - E(ix a )\ < J \fi r \-\V-E\(fx r )dr (2.10) 

for all t,s £ I with £ < s along any absolutely continuous curve (/it)tei with values 
in £P(X) with E(fi t ) < oo. As a consequence, it holds 

E{m)<E{n s ) + ^J \fi r \ 2 dr + ^J \X7^E\ 2 (p r )dr (2.11) 

for all t < s. From (|2.10p and (|2.11|) . it is natural to give the following definition. 

Definition 2.6 (Gradient flows) Let E : 0>(X) -> MU{+oo} be a ET-geodesically 
convex functional. We say that an absolutely continuous curve (Mt)te[o,oo) i n 
is a gradient flow of 2? provided E(m) < oo for i > and 

E(fH) = E(fi s ) + ^J \(i r \ 2 dr + ^£ \V-E\ 2 (fi r )dr (2.12) 

for all < t < s. The equation fj2. 12f) is called the energy dissipation identity. 
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The existence of such a gradient flow comes from the general theory presented 
in [H Corollary 2.4.11]. Furthermore, as shown in \27\ Theorem 6.2] (see also |14^ 
Theorem 4.2]), the gradient flow produces a contraction semigroup in the sense 
that for any v G &{X) with E(fi) < oo and E(v) < oo, the gradient flows 
(^)t G [o,oo), (^)ie[o,oo) starting from /j,,u satisfy 

W 2 {m, vt) < e~ Kt W 2 {^ v) for all t > 0, (2.13) 

where K is the modulus of convexity of E. In particular, the uniqueness follows 
from considering [i = v. (Though the strategy of the construction in |27| is differ- 
ent from [I], the resulting flow is the same by uniqueness, see [14, Remark 2.7].) 

Remark 2.7 (i) In |38} Theorem 7] the contr activity (from the geodesical con- 
vexity) is shown on spaces satisfying the local angle condition. Alexandrov spaces 
satisfy this condition. 

(ii) The first author [131 Theorem 15] proved the uniqueness of the gradient 
flow of the relative entropy on general metric measure spaces satisfying CD(K, oo), 
without relying on the contractivity. In this generality, the contractivity fails. 
More precisely, the heat flow on a finite-dimensional (Minkowski) normed space is 
not contractive (except for inner product spaces; see [29]). 

In the sequel, we mainly study the gradient flow of the relative entropy. Then 
()2.13p allows us to continuously and uniquely extend the gradient flow semigroup 
to the full &(X) (since the closure of &*(X) is &(X)). Such an extension also 
satisfies (I2.13|) . 

3 Identification of the two gradient flows 

This section contains our main result: 

Theorem 3.1 (Identification of the two gradient flows) Let (X,d) be a com- 
pact n- dimensional Alexandrov space without boundary. For any v £ &(X), the 
gradient flow of the Dirichlet energy starting from v is the gradient flow of the 
relative entropy, and vice versa. 

Recall that CD(if,oo) holds with K = (n — l)k, and we use this, e.g., in the 
proof of Proposition 13.21 below. The main technical obstacle in the proof of this 
theorem is to let the L? and the W 2 structures 'interact'. Our strategy consists 
in picking a gradient flow of the Dirichlet energy and in proving that it obeys the 
energy dissipation identity f|2. 12j) for the relative entropy. We start with a bound 
on the local slope (12. 9p by the Fisher information. 
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Proposition 3.2 (Bound on the local slope) Let fj, = fH n G @>*{X) with 
f G W^ 2 {X). Then it holds 



Jx J 



Proof. We first assume that / is Lipschitz and bounded away from 0. Then we 
know that (V/, V/) = |Vl/| 2 rl n -a.e., so that the conclusion follows from [4*2l 
Theorem 20.1] together with CD(K, oo). Thus all we need to do is to proceed by 
approximation. Suppose that < c < / rl n -a.e. for some c£t. Since C Lip (X) is 
dense in W 1,2 {X) and \ff G W 1,2 (X), we can find a sequence {<?j}jeN of Lipschitz 
functions such that gi converges to \ff as i — > oo with respect to the Sobolev 
norm. Substituting C% max{^, , y/c} for some Cj > if necessary, we can assume 
that yjcl < gi for some q > H n -a.e. as well as [IffiH^a = 1 for all i. Set fi := . 
As /j is Lipschitz and bounded away from 0, we have 



|V_Ent| 2 (/^ n ) < f {Vfh f V/i) dU n . 

JX Ji 



On the one hand, the right-hand side is equal to 4£(gi,gi) and converges to 
A£(\ff, y/J) = J x (V/, V/) / / dW 1 by construction. On the other hand, we deduce 
from [H Corollary 2.4.10] that 

|V_Ent|(,/7r) < lim | V_ Ent | (/^ n ). (3.1) 

i— >oo 

Combining these, we prove the claim for /. It remains to remove the assumption 
that / is bounded away from 0. To do this we just consider f\ := (1 + i~ 1 )~ 1 (f + 
2 _1 ) and apply (|3.ip again. □ 

Let us turn to considering the gradient flow of the Dirichlet energy (Tfu) te i 0iQO y 
For simplicity of notations, we write ft = dTtv/dl~L n in the remainder of the 
section. By (|23|) . f t G C(X) C L 2 (X,H n ) holds for t > 0. Then ([221), (23} and 
(|2.3p imply f t G D(A). For any t > 0, there is > such that /j > e% holds since 
Pt(x, y) is positive and continuous. As a well-known fact, the following bound also 
holds. 

Lemma 3.3 (Maximum principle) Let /o G L 2 (X,'H n ). If fa > c a.e., then 
Ttfo > c a.e. /or every t > 0. T/ie same /iota's /or bounds from above. 

Proof. Take Acl measurable. Since Xj is Markovian, Ttl^ > rl n -a,.e.. Thus 
/ (T t f -c)dH n = [ T t (f -c)dH n = [ T t l A -(f -c)dH n >0. 

J A J A JX 

Since ^4 is arbitrary, the assertion holds. Bounds from above follow by applying 
the same argument to — /q. □ 
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By virtue of Lemma 13.31 with the remark before it, for any e > 0, there are 
c, C > such that 

c < f t <C for all t > e. (3.2) 



Proposition 3.4 (Entropy dissipation) Let v £ tP'(X) and ft = dTtv/dH n . 
Then the function t i— > Ent(/^ n ) is locally Lipschitz in (0, oo) and, moreover, it 
holds 

#Ent(/ t ?0 = - / (V/t ' V/t) cm" a.e. t. 

Proof. As the function e(s) := s log s is C 1 in [c, C] and 4 i— ¥ ft is locally Lipschitz 
in (0,oo) with values in L 2 (X,H n ), we deduce from (|3T2]) that t Ent(/^™) 
is locally Lipschitz in (0, oo). Applying formulas (12, 2\\ and (12. 7p and recalling 
J x A/ t a"H n = 0, we obtain 

^ Ent(/ t 7T) = | e' (ft) Af t dU n = j [ log(/ t ) + l) A/ t oTT 

= -/ (viog(/ t ),v/ t ) m n = - [ (v/t ; v/t) ott. 



□ 



For the next argument, we briefly recall some properties of the Hamilton- Jacobi 
semigroup in our context. For / S C Lip (X) and t > 0, we define Qtf ■ X — > M by 



/(y) 



2t 



(3.3) 



Also, set Q$f := /• The following is shown in [H Theorem 2.5(iv)] and 
Theorem 2.5(viii)] in the framework of general metric measure spaces supporting 
the volume doubling condition and the Poincare inequality for upper gradients. 

Proposition 3.5 (Hamilton-Jacobi semigroup) It holds Qtf G C Lip (X) for 
every t > 0, the map [0, oo) 3 t t-t Qtf € C(X) is Lipschitz in the uniform norm, 
and 

for almost every t, x. 

Remark 3.6 The equation (|3.3|) has been called the Hopf-Lax formula or the 
Moreau-Yosida approximation also in the literature. The former name is mainly 
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used in the PDE context to a special solution to the Hamilton-Jacobi equation 
(|3.4p . The latter one is typically used for an approximation of an (unbounded) 
operator in functional analysis. 

Proposition 3.7 (Absolute continuity with respect to W2) Forv S £P(X), 
the curve t \-t Ttv is absolutely continuous in the space (£^(X), W2) and its metric 
speed \Tfv\ satisfies 

(V/ tj V/ t ) 



\TM < 



x 



ft 



■dU n , for a.e. t. 



(3.5) 



Proof. Fix t,s > 0. By the Kantorovich duality (cf., e.g., [IJ Theorem 6.1.1] and 
\4:2\ Theorem 5.10]) together with (13.3H . we obtain 



-W£ (Ttv,Tt +8 u) = sup 
z <pec Li p(x) 



x 



{Qw)f t+S dn n - / (pftdn 



x 



(3.6) 



By Proposition 13.51 the map r h-> Q r ip from [0,1] to L 2 (X,T-l n ) is Lipschitz. 
Moreover, it is differentiable in L 2 (X, 7-L n ) and the derivative is determined by 
dUD- The curve [0, 1] 3 r ^ f t+rs e L 2 (X,U n ) is Lipschitz as well. Thus [0, 1] 3 
r 1 y (Q r f) ft+rs £ L 1 (X,'H n ) is Lipschitz and its derivative can be calculated with 
the Leibniz rule. Thus we have 



x 



(Qi<p)ft+s*H n 

1 d_ 

„ dr 

1 



Vftd% n 



x 



X 



(QrV) ft+rs dH 

\VlQM 2 



dr 



f t+rs + s{Q r <p)Af t+rs dH n dr. 



(3.7) 



Using formulas ()2.2|) . (j3.2|) and the trivial inequality 



<V 5 , Vg) < ^ (Vg, Vg) + | (Vfif, Vg) 



H n -&.e. for s > 0, 



we have 



(Q r <p)Af t+rs dn r 



X 



(VQ r ^,vf t+rs )dn r 



X 



X 



< 



2s 7x 2 y x 



(V/ t+rs ,V/ tH 
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Plugging this inequality in (|3,7p and recalling that {VQ r tp : VQ r ip) = \VhQrf\ 2 
H n -&.e. (since Q r (p is Lipschitz), we obtain 



/ (Q lV )f t+s dH n - [ <pftdU n <^ I' I {Vft+r f s ' Vft+rs) dH n dr. 

JX Jx 1 Jo Jx Jt+rs 



This bound does not depend on ip, thus from ()3.6f) we deduce 

W$(T t v,T t+a v) <s 2 C I ^±If^±I^dn n dr. (3.8) 

JO JX Jt+rs 

Since we have (|3.2[) and the Dirichlet energy decreases along the flow t h- > f t , we 
obtain 

2 rl r 2 r 

Wi(T t v,T t+s v) < — / / (Vf t+rs ,Vf t+rs )dn n dr<- (V f t ,V f t ) dU n , 
c Jo Jx c Jx 



which gives that the map t i— )■ E ^(X) is locally Lipschitz. The bound (|3.5p 
follows directly from (|3.8p (by recalling the definition of the absolutely continuous 
curves (EED). □ 



Now we are ready to prove Theorem 13 .11 
Proof of Theorem 13.11 By Propositions 13.71 13.21 and 13.41 we have 

^l/^ n | 2 + ^|V_Ent| 2 (/ t ^")< f (V/t ' V/f) dU n = -±Ent(f t H n ) 

Jx h «t 

a.e. t. As the reverse inequality (|2.1ip is always true, equality (|2.12p holds for all 
< t < s. Since (ftT~i n )te[o.oo) 1S continuous also at t = 0, it is the gradient flow 
of the relative entropy. The converse immediately follows from the uniqueness of 
the gradient flow of the relative entropy. □ 



Remark 3.8 (i) When v = foTi n with fo G L 2 (X,H n ), we can give a proof of 
Theorem 13.11 without relying on the positivity improving property ()3.2|) . Indeed, 
by virtue of the contraction property of both flows, it is possible to use Proposi- 
tions 13.41 13.71 only for < c < fo < C and prove the theorem via approximation. 
It suggests that our argument possibly works in a more general framework where 
the existence of the density Pt(x,y) does not follow from the theory of Dirichlet 
forms. In such a case, we could 'construct' pt{x, •) as the gradient flow starting 
from the Dirac measure 6 X E &(X) (via the contraction property). 

(ii) In the proof of Theorem 13. 11 we heavily rely on the fact (V/, V/) = | Vif\ 2 
for / £ C Lip (X) for which we used the local structure of Alexandrov spaces. 
Actually, we use |Vl/| in Propositions 13.21 13.71 and |V/| in Propositions 13.41 13.71 
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4 Applications 



In this section, we assume that (X, d) is a compact Alexandrov space without 
boundary satisfying CD(K, oo), and prove some applications of Theorem 13.11 It 
should be stressed that the 'sectional curvature bound' k in the sense of Alexan- 
drov appears nowhere in the sequel, and the 'Ricci curvature bound' K is essential 
instead (recall Remark 12. 5p . Indeed, those results involving K are natural exten- 
sions of the corresponding ones on a Riemannian manifold with Ric > K. In this 
sense, the emergence of K instead of k is natural and gives sharper estimates. 

Since the gradient flow of the Dirichlet energy T t u is obviously linear and 
symmetric, we immediately obtain the following: 

Theorem 4.1 (Linearity and symmetry) For v G &(X), let (fJ-t)t>0 be the 
gradient flow of the relative entropy on &{X) with [Iq = v. Then the following 
hold true. 

(i) For uq, v x G £?{X), A G [0, 1] and t > 0, 

(ii) For f,ge L\X,H n ) with f,g>0 and ||/|| L i = = 1, 

[ fdtf nn = [ gdii{ H \ 

We remark that the linearity, but not symmetry, also follows from the gradient 
flow approach under the local angle condition ([381 Theorem 8]). In general, these 
properties are completely nontrivial, and the linearity indeed fails in the Finsler 
setting ([28]). 

A new property for (Tt^)t>o,i/£,^'(x) coming from our identification with the 
gradient flow of the relative entropy is the L 2 -Wasserstein contraction (|2.13p . To- 
gether with \19\ Corollary 3.4], we obtain the following: 

Theorem 4.2 (Contraction for the heat flow) For 1 < p < 2, 

W p (T t u , T tVl ) < e- Kt W p (v , fi) (4.1) 
holds for every vq,v\ G &(X). 

Furthermore, by the duality result |19l Theorem 2.2], Theorem 14.21 yields the 
following Bakry-Emery type L -gradient estimate 

\V L T t f\(x) < e-^T t (|V L /| 2 )(x) 1 / 2 (4.2) 
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for any / G C Lip (X) and x € X. By combining (j4.2|) with the regularity of the 
heat kernel, we can extend it to / G P^ 1,2 (X) as follows: 

Theorem 4.3 (Gradient estimate) Let f G W /1 ' 2 (X) and t > 0. T/ien T t f G 
C Lip (X) and 

|V £ T t /|(x) < e- Kt T t (\Vf\ 2 )(x) 1 / 2 (4.3) 
ZioZds /or all x £ X . In particular, 

\VT t f\(x) < e- Kt T t (\Vf\ 2 )(x)^ 2 for a.e. x (4.4) 



and \V L T t f\ < e- Kt ^/\\T t \\ Ll _, Lao £(f,f) hold. 

Proof. Take {fi} ie N C C Lip (X) such that /< -> / in W 1,2 (X) as i -> oo. 
Then (JOD yields r t /< G C Lip (X). Let y G X and 7 : [0, 1] -> X a unit speed 
minimal geodesic from x to y. Since |Vi,Ii/j| is an upper gradient of T t fi (see [U 
Proposition 1.11] for instance), we have 

\TtfM - T t fi(x)\ < f \V L T t fiMs)) ds 
Jo 

<e~ Kt f r t (|VL/i| 2 )(7(a)) 1/2 ds, (4.5) 
J 

where the second inequality follows from (|4.2p . Thanks to f|2.3|) and the bound- 
edness of pt, T t /j converges pointwisely to T t f as i -> 00. Since T t {\ ^Lfi\ 2 ){z) = 
Tt{\^J fi\ 2 ){z) for z G X, Ti(|Vi/i| 2 ) converges pointwisely to T t (\Vf\ 2 ) in a similar 
manner. Thus, by letting i — > 00 in (|4.5p . we obtain 



\T t f(x)-T t f(y)\<e- Kt f T t (|V/| 2 )( 7 (s)) 1/2 ds. (4.6) 

JO 

The boundedness of pt yields that there is C > satisfying 

T t g{z) < C\\g\\ L i {X:nn) (4.7) 

for all g G L 1 (X,'H n ) and z G X. Since Z = d(x,y), the estimate (14. 7H for 
g = |V/| 2 together with (JO)J implies T f / G C Lip (X). In order to show fO|) . 
choose a sequence {yijieN in X so that yj — > x as i — > 00 and 

,. \Ttf(x) - T t f(yi)\ 

llm M S = \ V LT t f\{x). 

«-s>oo d(x,yi) 

By the continuity of p t , T t (|V/| 2 ) G C(X) holds. Thus, applying ([4T6]) for y = y*, 
dividing both sides of it by d(x, y^) and letting i — >■ 00 yield (|4.3|) . □ 
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As an easy but important consequence of Theorem 14,31 we obtain the Lipschitz 
continuity of the heat kernel pt(x,y) as well as that of eigenfunctions. Recall 
that —A has discrete spectrum consisting of nonnegative eigenvalues with finite 
multiplicity ([201 Corollary 1.1]). 

Theorem 4.4 (Lipschitz continuity) (i) For v G £P(X) and t > 0, let ft = 

dTtis/dH n . Then ft G C Lip (X) for t > 0. In particular, we have pt(x,-) G 
C Lip (X) and T t f G C Up (X) for all x G X and f G L l {X,U n ). 

(ii) Let f be an L 2 -eigenf unction of A corresponding to an eigenvalue —A < 0. 
Then f G C U ?(X). Moreover, \V L f\ < e^V^II^ HlUl~ II/IIlW) 
holds for each t > 0. 

Proof Since / t G W 1 ' 2 (X), Theorem EH yields that / t = r t/2 / t/2 G C Lip (X). 
For the second assertion, note that / = e xt Ttf. Then the first assertion and 
£(f,f) = M\f\\L2(x,H n ) yield tlie conclusion - a 



Remark 4.5 (i) It has been known that the heat kernel pt(x, y) is (locally) Holder 
continuous of some fractional exponent, that follows from the parabolic Harnack 
inequality shown in [20J. 

(ii) The existence and the continuity of pt are used in the proof of Theorems 14. 31 
14.41 in an essential way (cf . Remark I3.8[) . 

(iii) To obtain a useful estimate of \ViTtf \ along our argument, we need a 
nice bound for ||Tt||2,i->£°°. For instance, the parabolic Harnack inequality, or the 
Nash inequality, implies 

pt{x > y) - nHBv-ti*)) (48) 

with some constant C > being independent of t, x, y, for small t. It gives a bound 
for ||Ti||£i^£oo. By a general argument, (|4.8p follows from the local Poincare 
inequality and the volume doubling condition, both of which depend only on the 
dimension n of X and a lower curvature bound (see [201 [21~| [36], [39] for instance) . 
However, we should be careful if we want to know whether C in (14. 8|) depends on 
the diameter and/or the volume of X. Indeed, estimates of type (|4.8p are mainly 
studied on non-compact state spaces and hence they did not seem to pay so much 
attentions on such a dependency in the literature. 

In what follows, we consider two additional applications of the Bakry-Emery gra- 
dient estimate (|4,4p by employing the Lipschitz continuity of Ttf. The first one is 
the following inequality: 
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Theorem 4.6 (r 2 -condition) Let f G D(A) with Af G W^ 2 (X). Then, for 
g G D(A) n L°°(X,H n ) with g > and Ag G L°°(X,'K n ), we have 

\ I Ag(Vf,Vf)dn n - [ g{VAf,Vf)dV: n >K [ g (V f,V f) d7i n . (4.9) 
z Jx Jx Jx 

Remark 4.7 (i) By virtue of the analyticity of Tt and Theorem 14.41 we have 
T t f G L>(A m+1 ) and A m T t f G C Lip (X) for any / G L 2 (X,H n ), t > and m > 0. 
Thus there are fairly many / and g's satisfying the condition in Theorem 14.61 
(ii) The inequality fj4.9j) is nothing but a weak form of the ^-condition 

r 2 (/, /) := 1{ A«V/, V/)) - 2 (V/, VA/> } > K (Vf, Vf) . (4.10) 

This inequality is known to be equivalent to (|4.4p in an abstract framework (see 
[3j [23] and references therein, for instance). However, the assumption involves 
the existence of a nice core A C D(A) and it seems hopeless to verify it on 
Alexandrov spaces. When X is a complete Riemannian manifold, the inequality 
(|4.10p is equivalent to Ric > K. Indeed, 

r 2 (/,/) = Ric(V/,V/) + |Hess/| 2 

holds by the Bochner identity 

Proof. We first show the claim for / G D(A) n C Lip (X) with Af G D(A) D 
L°°(X,^ n ). By g2D, we obtain 

/ 5 (VT t f, VT t f) dU n < e~ 2Kt [ gT t ((Vf, Vf)) dU n . (4.11) 
Jx Jx 

The derivation property (|2.7j) yields that, for t > 0, 

/ g(VT t f,VT t f)dU n = [ {V{gT t f),VT t f)dU n - ( T t f {Vg,VT t f) dU n 
Jx Jx Jx 

= - [ gT t f AT t f dU n -\ I (Vg,V{T t f) 2 )dU n 

Jx 2 J x 

= - f gT t fT t Af dU n + )- I Ag(T t f) 2 dU n . (4.12) 
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Hence we obtain 



d F 

- j g(VT t f,VT t f}dH r 

g(Af) 2 dU n - I gfA 2 fdH n 



x 



AgfAfdH n 



x 



X 



(V(gAf),Vf)ctH n + I (V(gf),VAf)dH n - [ (Vg, V(/A/)) dH n 
Jx Jx 

g(VAf,Vf)dn n (4.13) 



x 



by using the derivation property again. Since / G C Lip (X), (Vf, V/) = |Vl/| 2 G 
L°°(X,% n ) holds. Hence we have 

d 



dt 



-2Kt 



gT t ((Vf,Vf))dH n 



d_ 

dt 



x 

2Kt 



t=0 

(T t g) (Vf,Vf)dn n 



x 



t=0 



[ Ag(Vf,Vf)dH n -2K [ g (V /,V /) dH n . (4.14) 
Jx Jx 



Since (I4.1ip implies 

g(VT t f,VT t f)dW 



t=o 



5 s 1 " 



2Kt 



gT t ((Vfyf))dH n 



x 



(4.15) 



t=o 



we obtain (|4T9)) by combining (|4.15p with (|4.13[) and (|4.14p . 

Next we consider the case that / G £>(A) with A/ G W 1 ' 2 (X). Then, by 
the above discussion, T$f and g satisfy (|4.9p for 6 > (cf. Remark I4.7f i)). 
Since #,A# G L co (X, / U a ) and AT^/ = T s Af, it suffices to show the claim that 
lim^o (VT s h,VT s ti} = (Vh,Vti) weakly in L 1 (X,Tl n ) for h,ti G W 1,2 (JQ. By 
polarization, we may assume h = h' . The spectral decomposition yields 



lim £(T s h - h, T s h - h) = 
(5—5-0 



(4.16) 



(see |12| Lemma 1.3.3], for instance). Let ip G L°°(X,'H n ). Then the Schwarz 
inequality yields 



ip (VT s h, VT 5 h) dH r 



^{Vh,Vh)dH n 



x 



< 



X 



^ 2 \VT 5 h\ 2 dK 



1/2 



+ 



x 



^ 2 \Vh\ 2 dW 



1/2^ 



< 2||^||i«»f (h, h) l / 2 £{T s h - h, T 5 h - h) 1 ' 2 . 



£(T s h- h,T s h- h) l/2 
(4.17) 
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Hence the desired claim follows from (|4.16p and ()4.17p . 



□ 



While we proved the implication from (|4.ip with p = 2 to (|4.4p and (|4.9p . these 
conditions are equivalent to each other on complete Riemannian manifolds (see 
[37] ). Such an equivalence still holds in our framework with a sharp constant, 
which can be different from K in our hypothesis CD(K,oo): 

Theorem 4.8 (Equivalence of "Ricci curvature bound" inequalities) Given 
Kq G R ; the following are equivalent. 

(i) flUE} holds for v , v\ G 0>(X) and t > 0, with p = 2 and K = K . 

(ii) g3]) holds for f G W 1 ' 2 (X) and t > with K = K . 

(iii) (|MD holds with K = K for f G D(A) with Af G W l ' 2 {X) and g G 
D(A)nL°°(X,'H n ) with g > and Ag G L°°(X,-H n ). 

Proof, "(i) (ii) =^ (iii)" is already shown in Theorem 14.31 and Theorem 14.61 

For "(iii) => (ii)", it follows from a standard argument of the so-called T2- 
calculus (see [31 [23] for instance). For completeness, we give a sketch of the 
proof. Take go G C(X) with go > arbitrary and let g = T$go- Then g G 
D(A)nL 00 (X,'H n ) with g > and Ag G L°°(X,n n ). Let us define * : [0,t] -»■ M 



By a similar calculation as in ()4.12j) . we can easily prove that ^ is continuous on 
[0, t) and C 1 on (0,t). A similar argument as in (|4.17p yields that ^ is continuous 
at t. Here we use the ultracontractivity ||Tt||2,i_>Loo < oo. A similar calculation 
as in the proof of Theorem 14.31 together with the assumption in (iii) leads to the 
inequality ^f'(s) > 2Ko^f(s). Hence (|4.4|) follows by integrating it. 

For "(ii) => (i)", we claim that (|4.4I) implies (|4.3p for every x G X. Indeed, by 
using a bi-Lipschitz chart, we can bring the problem locally on an open set in a 
Euclidean space. Then, by applying [2J Lemma 3.2.1] and by using the continuity 
of Ti(|V/| 2 ), the claim follows. Then we can apply [19j Theorem 2.2] to conclude 



Remark 4.9 By the same argument as in "(ii) (i)" of the last proof, we can 
give a proof of T t f G C Lip (X) for / G W 1 ' 2 {X) under the condition (|4.4p . In other 
word, a priori regularity Ttf G C Lip (X) by Theorem 14.31 is not used in the last 
proof. 



by 




(i) from (0]> . 



□ 
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As the second application of (|4.4p . we mention that (|4.4p together with Theo- 
rem H3] implies some functional inequalities by means of |17l Theorem 1.3]. Since 
Tt is Markovian, we can restrict Tt to a contraction on L ao {X,ri n ). Then we can 
further extend Tt to a contraction on L p (X,TL n ) for any 1 < p < oo by interpo- 
lation and the symmetry of T t . Let us denote the infinitesimal generator of T t in 
V(X,H n ) by A p . Let us define fl&V := |V((a - A p )-«/ 2 /)|. 

Corollary 4.10 Let 2 < p < oo, q > 1 and a > max{(— X), 0}. TTien we have 
the following: 

(i) There exists a constant Cr > which depends only on p,q and max{(a + 
K), a} such that 

\\R { a q) f\\ LP < C R \\f\\ LP 

forfGL p (X,rl n ). 

(ii) Suppose q < 2. Then there exists C Ptq > such that 

ll|VT t /||| iP < c p jrM\\ lp ^ lp Ui 2 + t-«i 2 ) \\f\\ LP 



fort>0 and f G L p (X,7i n ). 

Proof. It is sufficient to verify the assumption of [17, Theorem 1.3]. Set 
A := C Lip (X) n D(/S.2). Since we already know (|4.4p . we only need to show 
the following claim: A is dense in W 1,2 (X) and f 2 G D(Ai) holds for any 
/ G A By Theorem 031 I*/ G A holds. Thus „4 is dense in W 1 ' 2 ^) since 
£(T t / - f,T t f - /) as t ->■ (cf. gH])). By [6, Proposition 1.2.4.3], it is 
enough to prove / 2 G -D(A 2 ) for / G A Take g G iy 1,2 (X) arbitrary. Recall that 
|V/| 2 = |V L /| 2 G L°°(X,fl n ). The derivation property yields 



|£(/ 2 ,2)l=2 



f{Vf,Vg)dH r 



x 



X 



((Vf,V(gf))-g(Vf,Vf))dH n 



[ (fAf-(Vf\Vf))gdrl n 
Jx 

<2(||/|M|A/|| L2 + |||V/| 2 || i2 )|| 5 || i2 . 



This estimate means f 2 G D(A2) and hence the proof is completed. 



□ 
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Finally, we observe that all of our results are generalized to the heat equation 
with drift, in other words, the Fokker-Planck equation. Given a potential function 
V G C Lip (X), we modify the Dirichlet energy and the relative entropy into 

£ V U,9)= f (Vf,Vg)e- v dU n f, g eW^ 2 (x), 

Jx 

Ent v (n) = Ent(» + / V dfi fiG^(I). 
Jx 

We regard £ clS Si bilinear form on L 2 (X,e- v n n ). Observe that Ent v is nothing 
but the relative entropy with respect to e~ v % n . Note that the semigroup T t on 
L 2 (X,e~ v T-L n ) associated with £ v solves the following diffusion equation 

-^u = An - ( W, Vu) . (4.18) 

Since e~ v is bounded and away from 0, e~ V/ H n is equivalent to 7i n . Hence 
(X, d, e~ v T-L n ) satisfies the volume doubling condition as well as the Poincare in- 
equality for upper gradients. Moreover, as £ and £ are equivalent, the Poincare 
inequality for £ is also valid. Therefore a continuous density p( for exists. 
Under the assumption that Ent^ is K-geodesically convex, we can apply the gen- 
eral theory of the gradient flow on (^(X), W2) to obtain the gradient flow fit of 
Ent . Furthermore, every argument in Sections [3l H] works verbatim and gives 
similar results for the equation (14.181) . Note that, under CD(K, 00) for (X, d, H n ), 
the i^'-convexity of V implies the (K + X')-convexity of Ent^. 
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